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Abstract 

^ ■ This paper studies the problem of relay-assisted user scheduling for downlink wireless trans- 

mission. The base station or access point employs hybrid automatic-repeat-request (HARQ) 
with the assistance of a set of fixed relays to serve a set of mobile users. By minimizing a cost 
function of the queue lengths at the base station and the number of retransmissions of the head- 
of-line packet for each user, the base station can schedule an appropriate user in each time slot 
and an appropriate transmitter to serve it. It is shown that a priority-index policy is optimal for 
a linear cost function with packets arriving according to a Poisson process and for an increasing 
convex cost function where packets must be drained from the queues at the base station. 

Keywords - Relays, scheduling policies, hybrid automatic-repeat-request, priority- index rules. 



1 Introduction 



Relay-assisted communication will increase system throughput and coverage in local and metropoli- 
tan area networks [1]. The focus of this paper is on quantifying the scheduling-related benefits of 
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relay-assisted communication. For downlink transmission, this raises the question of how the presence 
of fixed relays impacts the user scheduling decisions at the base station or access point. Relay-assisted 
scheduling has been studied in a variety of network!] [2-5]. 

The benefits of relaying are enhanced by intelligent reception strategies such as HARQ [6]. Most 
of the prior work on HARQ-based scheduling does not account for the potential benefits of assist- 
ing relays, though. Relays can be employed to decrease the number of HARQ transmissions that 
are required to serve a particular user. Given a HARQ transmission framework, the relay-assisted 
scheduling problem is not merely a function of user queue lengths at the base station [2]. It is now 
important to also consider the number of HARQ transmissions that have occurred for each user's 
head-of-line (HoL) packet, which directly impacts the decoding delay that each user incurs. Decod- 
ing delay also depends on queue lengths [7] and has influenced work in the scheduling domain [8]. 

In this paper we derive cost-minimizing user scheduling policies for a relay-based modification of 
the HARQ model in [9]. In [9], packets arrive at the base station, and in each time slot one of the users 
is scheduled. Each user has a cost function that depends on its queue length at the base station and 
the number of retransmissions that have occurred for its HoL packet. The objective is to schedule a 
user to minimize the long-term average expected cost. It is shown in [9] that the optimal scheduler is 
a fixed priority-index policy [10], where an index, or "priority," is calculated for each user. The users 
are ranked according to their priority indices, and the highest-ranked user with a nonempty queue is 
serviced in that time slot. One example of a priority index is the ratio of the storage cost at the base 
station for a given user's packet to the expected time before that user decodes that packet [9]. 

Since the analysis in [9] does not consider the presence of relays, it is unclear as to whether 
a priority-index policy is optimal for our relay-assisted system, as the priority index for each user 
can be adjusted if at least one relay has previously decoded its HoL packet. To address this issue, 
we consider relay- assisted variants of the two problems in [9]. One problem entails minimizing a 
linear cost function with Poisson arrivals at the base station (LPA), while the other problem entails 
minimizing an increasing convex function of queue length without any new arrivals at the base station 

^^Note that as the number of relays increases, communication between the base station and the relays becomes 
more challenging. In particular, the level of signaling overhead increases, and intelligent frequency reuse planning for 
multi-relay transmission becomes more difficult. 
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(DC). We prove that the optimal scheduler for the relay-assisted variants of the LPA and DC problems 
is actually a priority- index policy as in [9]. 

2 System Model 

First, we introduce the notation used throughout the paper. \z\ 2 denotes the absolute square of a 
complex number z. E denotes the expectation operator. 

Consider the system in Fig. (TJ which consists of a single base station, M fixed relays, and N 
users. Packets for each user arrive at the base station, and each packet is placed in a queue for its 
intended user. The packet arrival processes are mutually independent. Let h^ n denote the channel 
between nodes i and n. In each time slot, the base station selects a packet, which is then transmitted 
by the base station or a selected relay. This relay must have previously decoded that packet. If the 
scheduled user decodes the packet, the base station flushes it from its queue, and each relay removes 
the packet from its memory. If the scheduled user cannot decode the packet, though, it remains in 
its queue at the base station. Each relay also retains the packet in its memory, and the base station 
may either select that packet or a packet intended for another user for the next time slot. 

When a packet is transmitted, its intended mobile target and all of the relays that have yet to 
decode it employ a generic HARQ decoding strategy. For example, each of these receiving nodes 
can use maximal-ratio combining of successive transmissions of this packet, with the objective of 
improving its decoding probability. 

We assume that before each time slot, the base station knows its channel gain \h tt i\ 2 to each user % 
and the channel gain \h a ^ from each relay a to each user i. This is reasonable for a cellular network 
with a relatively small number of users N and relays M. As N and/or M grow large, though, the level 
of signaling overhead would become prohibitive for proper network operation. We also assume that 
time is slotted, and each channel gain |/ij jn | 2 remains constant over a single HARQ retransmission 
sequence, which consists of a finite number of slots. This assumption is reasonable in a slow fading 
environment. Each channel gain |/ij jn | 2 also varies independently from one HARQ retransmission 
sequence to the next, which is a block fading assumption. 



3 



Paper: J4-TVT, Second Revision, First Draft, May 16, 2009 



Given the transmission model, the relays are only considered in the scheduling policy when a 
selected user fails to decode its transmitted packet, requiring its future retransmission. Each relay 
can store one packet for each user, where 1) this packet has been transmitted by the base station and 
2) its intended mobile target failed to decode it. Thus, the packet arrival process at each relay can 
be described as follows: assuming that there is at least one nonempty queue at the base station, one 
packet arrives in each time slot. The packet arrival probability for a given user is the probability of 
that user being scheduled by the base station in that time slot. 

Let S(n) from [9] be the state vector for the base station at time slot n. Thus, S(n) includes the 
number of transmission attempts for the current HoL packet for each user and the queue length for 
each user. Also, let S a (n) = {R ajl (n) : R at2 (n), . . . ,R a ^ N (n)} be the state vector for relay a at time 
slot n, where R a ,i(n) = 1 if relay a has decoded the HoL packet for user i, but user % has not decoded 
it. Otherwise, R a ,i(n) = 0. Let M = {BS, 1,2,..., M} denote the set of allowed transmitters. Our 
objective is to design a scheduling policy n R e 11^ such that n R (S(n), Si(n), S 2 (n), • • • , Sm(^)) = 
(i, a), where transmitter a G M. serves the scheduled user i. 



3 Relay- Assisted Linear Poisson Arrivals Problem 

We consider the relay-assisted linear Poisson arrivals (RLPA) problem, which is a variant of the 
LPA problem in [9]. In the LPA problem, packets arrive at their corresponding queues at the base 
station. The arrival process of the packets for user % is Poisson with rate \. Let c itTi denote the cost 
of storing a packet for user i that has already been transmitted r\ times. The base station computes 
a cost function C/j for user i that both depends on r\ and is linear in the queue length Xj(n), where 



Q, (^(n)-1) + Ci H 0£(n) Xi (n)>0 
U i {x i {n),r i (n)) = { (1) 

xAn) = 



< a r . < c. ', Ti < r, 

implying that the storage cost is a nondecreasing function of the number of transmission attempts. 
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The RLPA problem entails determining the scheduling policy ttr £ Hr that minimizes the long-run 
average expected cost 



Jrlpa = lim -E T 



N 



n=l i=l 



The optimal policy for the RLPA problem is based on the fixed priority- index policy that is 
optimal for the LPA problem [9]. For the RLPA problem, knowledge of {Si(n), S 2 (n), . . . , Sjvf(n)} at 
the base station is useful in deciding which users can be served more quickly than others. Note that 
cost increases with the number of retransmission attempts and the incurred delay. 

Theorem 1. The optimal scheduling policy for the RLPA problem is a priority-index rule, where the 
HoL packet with the highest priority index over all nonempty base station queues is selected. The 
transmitter that yields the highest priority index transmits the selected HoL packet. 

Proof. The proof is in Appendix [A] □ 

We now provide an intuitive justification of Theorem [U In [9], each user % is assigned a fixed 
priority index where c i t h l is the holding-cost rate for the HoL packet of user i that 

5 i 5 i ' i 

has undergone rf oL transmission attempts and < rf oL < v™ 10 -*. Also, T i r HoL is the expected service 
time for the HoL packet of user i, where 

T iirfoi = l+ II 9i(l) (2) 

u =r HoL i =r HoL 

i i 

and Qiil) is the probability of a decoding failure by user i given that its HoL packet has been trans- 
mitted I times. The optimal policy from [9] is to schedule the user with the highest priority index. 
To simplify the following discussion, we have not considered the impact of relaying in (j2j). 

A lower value of T h l implies that user i achieves a higher priority index. Now consider the 
two-user system in [9, Figure 4], which we have re-plotted as Fig. [2J Here, a new arrival for the first 
user has priority over a retransmission for the second user. No relays are present, which is equivalent 
to the RLPA problem if S a (n) = (0, 0) Va £ {1,2,..., M}. Assume that only relay a has decoded 
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the HoL packet for user 2, and relay a can decrease T 2t hol since |/i a ,2| 2 > \ht,2\ 2 - This increases user 
2's priority, and if |/i a ,2| 2 is above a threshold value, user 2 can obtain a higher priority index than 
user 1, and so a retransmission for user 2 has priority over a new arrival for user 1. In particular, 
7t~n(S(n), Si(n), S2(n), . . . , Sm (n)) = (2, a) as opposed to n(S(n)) = 1 for the LPA problem. 

Thus, the introduction of relays for the RLPA problem results in a modification to the optimal 
policy in [9]. Users are still sorted according to their priority indices and the highest priority user 
with a nonempty queue is scheduled. In this case, though, each relay a can inform the base station 
of its ability to improve the priority indices of some subset of the users by reporting S a (n) to the 
base. The base station can calculate an improved priority index for each user i such that R a ^{n) = 1. 
Then, all priority indices including any revised indices are sorted, and the highest priority user with 
a nonempty queue is scheduled along with the transmitter that yields that highest priority index. 

4 Relay- Assisted Draining Convex Problem 

Now we consider the relay-assisted draining convex (RDC) problem, which is a variant of the 
DC problem in [9]. The DC problem is a draining problem where no new packets arrive at the base 
station, and the base station wants to empty all of the user queues. The base station computes a 
cost function Ui for user i, where Ui is an arbitrary increasing function of the queue length Xi(n) and 
is independent of the number of transmission attempts of the HoL packet of user i. Thus, 

Ui(xi(n),rf oL (n)) = U^n)). 

The base station initially has a set of packets £2(1), • • • , ^jv(I))- The RDC problem entails 

determining the scheduling policy ttr G Hr that minimizes the total expected draining cost 

00 N 

£J>(**(n)) . 

n=l i=l 

As in Section [HI the optimal policy for the RDC problem is based on the fixed priority-index policy 
that is optimal for the DC problem [9]. For the RDC problem, knowledge of {Si(n), S 2 (n), . . . , Sm(^)} 
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at the base station is useful in deciding which users can be served more quickly than others. Note 
that as in the RLPA problem, cost increases with the incurred delay. 

Theorem 2. The optimal scheduling policy for the RDC problem is a priority-index rule, where the 
HoL packet with the highest priority index over all nonempty base station queues is selected. The 
transmitter that yields the highest priority index transmits the selected HoL packet. 

Proof. The proof is similar to that in Appendix so we provide a brief sketch of it as follows. As 
in Appendix |A], we transform the RDC problem into an instance of Klimov's multiclass queueing 
problem [10]. This implies that the transformed problem, which we refer to as the RDCK problem, 
has an optimal priority index policy. Finally, it can be shown that this policy is also optimal for the 
RDC problem. 

Based on Appendix lAl in the RDCK problem each user i has fQ = (M + 1)2^(1) (rj 710 * + 1) queues, 
and each queue is labeled /). A packet in (i,ri,Xi,l) has been transmitted times, has 

been decoded by relay d^i and has not been decoded by relay G^ m for I < m < M. 

Now, the objective is to find 7r# G 11^ that minimizes 



Jrdck — 



Yl 1 i,n,a:i,l( n ) U i( X i 

n=i (i,n,xi,t)eci 



where 

1 (i, rj, Xi, I) is nonempty in slot n 



otherwise. 



It follows from [9, Theorem 2] and [9, Lemma 2] that the optimal policy for the RDCK problem 
assigns queue (i,r i} x^m) higher priority than queue (i,Ti,Xi,l) for all i, ar», r[ > ri and m>l. 

Since the RDCK problem is a special case of Klimov's problem, the optimal policy for the RDCK 
problem is a priority-index rule. We then transform the RDCK problem back to the RDC problem to 
conclude that the optimal policy for the RDC problem is also a priority-index rule. The HoL packet 
with the highest priority index along with the transmitter that yields that index are selected over all 
nonempty queues at the base station. Note that unlike the RLPA problem, the priority indices for 
the RDC problem do not admit closed-form expressions. □ 
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The intuitive justification of Theorem [2] is similar to that of Theorem [TJ It should be noted that 
for the RLPA and RDC problems, a relay a with R ai {n) = 1 for some user i only increases the 
priority index of user i if | ht,i \ 2 < \h aj i\ 2 . 

5 Simulation Results 

Now we evaluate the performance of relaying in the RLPA problem. Fig. [3] displays the impact 
of employing M = 1 relay in a system with N = 2 users and arrival rates Ai = A2 = 0.3. The 
maximum number of retransmissions is r™ ax = r™ ax = 2, and the cost rates are C\ = [0.98 1 1.02] and 
c 2 = [1.25 1.5 1.75]. We model the effects of limited transmit-side channel knowledge by assuming 
that each channel ht t i and hij undergoes Rayleigh fading and varies independently between time 
slots. The base station only knows E(|/z tj j| 2 ) and E(|/iji| 2 ), while the relay only knows E( | /^i^ | 2 ) . 
Assuming the presence of only channel distribution information at each transmitter implies that the 
base station only knows its average channel parameters to users 1 and 2 as fji = ^ 2 = 0.9, and the 
relay only knows its average channel parameters to users 1 and 2 as fji i and f/ 12 , respectively. The 
base station computes the user probability of decoding failure, assuming that no relays assist it, as 



To characterize the performance impact of relaying, we vary ^ 1)2 and fix fj^i = 0.9, as this limits the 
ability of the relay to assist user 1 and allows us to focus on how the relay assists user 2. 

We run our simulation over B time slots, where at most one packet can be decoded in each time 
slot. If D packets are decoded during this simulation run, we define the throughput as D/B. We 
see that the long-term cost decreases and the throughput increases as the average channel gains 
from the relay to the base station and user 2 increase. In particular, the cost decreases by 17.3% 
and the throughput increases by 43.2% when ?7 lj2 increases from 0.1 to 0.9. This demonstrates the 
performance gains via intelligent relay deployment. Also, we see that the effects of limited channel 
knowledge at the base station and the relay are asymptotically negligible. 




.max 



(3) 
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Fig. H] shows how the optimal policy behaves as a function of the average base station channel 
gains to the users. We adopt the same parameters as in Fig. [3j except for 771,1 = 77^2 = 0.15, and we 
vary 772. 

We see that the throughput of the optimal policy deteriorates as the average base station channel 
gain to user 2 decreases. We also consider a case where no relay is present, and it can be seen that 
the throughput of the optimal policy decreases at an even faster rate than the case where M = 1 
relay assists the base station. This example further highlights the inherent challenges in a cellular 
network of servicing cell-edge users. 

6 Conclusion 

We have considered the problem of user scheduling in a downlink wireless system with HARQ 
retransmissions. By allowing fixed relays to assist the base station or access point in servicing a 
scheduled user, a cost function of the user queue lengths at the base station and the number of 
retransmissions of the HoL packet for each user can be minimized. We have proved that the optimal 
scheduler for the relay-assisted extensions of two problems in [9] is a priority-index rule. 

The main contribution of this work opens up several avenues for further investigation of the 
relay-assisted scheduling problem. In particular, it is clear that relay-assisted scheduling is actually 
a relay selection problem, and extensive prior work on relay selection has conclusively shown its 
difficult cross-layer nature. Thus, a more comprehensive approach to this problem would consider 
additional factors such as the specific type of HARQ being employed at the relays and the users 
along with more general packet arrival processes at the base station. For example, if one user is 
downloading multimedia content while another is sending text messages, this could be used to design 
an appropriate cost function for each user. Also, the performance impact of real-world issues such as 
timing mismatch between the base station and any of the relays in its network should be evaluated. 
For example, if a delay of one time slot occurs before a selected relay receives its selection notification 
from the base station, then it will transmit and cause a packet collision with another selected relay 
or the base station, which could degrade the achieved throughput. 
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A Proof of Theorem ffl 

As in [9], we transform the RLPA problem into an instance of the multiclass queueing problem 
of Klimov [10]. Thus, the transformed problem, which we refer to as the RLPAK problem, has an 
optimal priority index policy. We then show that this policy is also optimal for the RLPA problem. 

Note that for each user i, the M relays are sorted as {e^i, <i i)2 , . . . , <2i,Af } ; where Ih^ lt i\ 2 < {h^ 2ji | 2 < 
■ • • < \h di Mi j| 2 . Then, each user i has (M + l)(r[ na:E + 1) queues, and each queue is labeled as (i, r i: I). 
A packet in (i,ri,l) has been transmitted r« times, has been decoded by relay dij, and has not been 
decoded by relay c?j m for I < m < M. There are a total of K = £- =1 (M + l)(r™ ax + 1) queues. If 
A = J2f=i ^ii eacn arriving packet is assigned to (i, 0, 0) with probability Pi >0 = Aj/A, and (i, r i: I) has 
a deterministic service time of b iir .j = 1 time slot. The cost of storing a packet in queue (z,rj,Z) is 
Ci >rit i and the number of packets in queue (z,rj,Z) at the beginning of the nth time slot is x^ ru i(n). 
Fig. [5] shows an example of the RLPAK problem for user 1 where r™ ax = 2. 

The queue transition probabilities in the RLPAK problem are determined as follows. Let gi t i t k(fi) 
denote the probability that relay d^i cannot decode the HoL packet of user i after its transmission 
attempt by relay d^k- Also, let gijfa, 1) denote the probability that user i cannot decode its HoL 
packet after its transmission attempt rj by relay d^i. Then 

P(i,r i ,o),(i ) r i +i,o) = 9i(ri)gi,i,o(ri)gi,2,o(ri) ■ ■ -ft,M,o(n)> 

P(i,n,o),{i,n+u) = 9i(ri){l ~ 9i,l,o( r i))9i,l+i,o( r i)9i,i+2,o(ri) • ' ■ 9i,M,o( r i) 

P(i,n,l),(i,ri+l,n) = 9i,li. r U - 9i,nA r i))9i,n+l,l( r i)9i,n+2,l(ri) ■ ■ ■ gi,M,l{ r i) > n > l > 

P(i,n,i),(i, n +i,i) = 9i,i( r ^ l )9i,i+iA r i)9i,i+2,i{ r i) " " ' 9i,M,i( r i)i 

P(i, ri ,l),(i,ri+l,n) = 0, 71 < I 

and so the packet departs the system from (i, rj, 0), (i, fj, /) and (i, r 4 maa: , I) with probabilities 1 — gi(ri), 
1 — 9i,i( r i> 1) anc ^ 1 respectively where / G {0, 1, . . . , M}. 

Thus, for any Acfi = {l,2,..., K} and any (z, r i; I) G A, the average total service time is 

rp{A) _ 1 ry-r(A) 
i i,n,; ~~ P(«,n,0.( fc . r fe. m ) i fc,r fcl m- 
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From the above discussion, it can be concluded that the RLPAK problem, which is a transformed 
version of the RLPA problem, is an instance of the multiclass queueing problem of [10]. This conclu- 
sion also relies on the simple queueing dynamics of the relay: 1) a packet only arrives at the relay if 
the base station has transmitted it, and 2) the relay automatically flushes a packet once it has been 
decoded by its intended user. In addition, the base station automatically flushes a packet once it has 
been decoded by its intended user. It should be noted that the state space of the RLPAK problem is 
an expanded version of that in the LPAK problem. 

Now, the objective is to find 7r# G 11^ that minimizes 



J rlpak — h m —^tt r 



To this end, we state the following result. 



C i,ri,l X i,ri,l( n ) 

n=i (i,n,i)en 



Lemma 1. Let A^, k = 1, 2, . . . , K be the sets of queues generated by the Klimov algorithm in [9, 
Section 3] for the LPAK problem. For each k = 1, 2, . . . , K and for all (i, r^, I) G A^: 
1 ) (i, r^m) G Ak for all r\ > and for all m > I. 

^^=i+E;;:- i nL,.( S )=e. 

3) = 1 + LSI' 1 UU 9i,m(s, 1) = T^ m , m > 0. 

1) T { y < T^ k ] for all r- > r { and for all m > I. 

i,r it m ' 

5) a k = argmin (iirii0eAfc (c iin>i /T.^). 

Proof. This result follows in a straightforward manner from [9, Lemma 1]. □ 

By combining [9, Theorem 1] and Lemma [H it follows that the optimal scheduling policy for the 
RLPAK problem is a priority- index rule where the priorities ai, a.2, ■ ■ ■ , ctx satisfy 



T (n) - T (n) - ■ ■ ■ - (fi) • 

-*- Oil J- OL2 OIK 



Since the optimal scheduling policy for the RLPAK problem is a priority-index rule, we employ [9, 
Corollary 1] to conclude the the optimal scheduling policy for the RLPA problem is also a priority- 
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index rule. The HoL packet with the highest priority index of c { t h l IT- h l along with the transmitter 

5 i ' i 

that yields that index are selected over all nonempty queues at the base station. 
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Figure 1: Wireless network with relay-assisted scheduling. 




Figure 2: Optimal priority orders versus holding-cost rate of user 2 in the LPAK problem. 



%8- 
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Figure 3: Long-term average expected cost and throughput for RLPA problem as function of average 
channel from relay to user 2. 
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Figure 4: Throughput for RLPA problem as function of average channel from base station to user 2. 
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Figure 5: System model for RLPAK problem with queues for user 1. 



14 



